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DATE- 07/09/2001 / 

RAW SEQUENCE LISTING ttmp in 10 43 / 

PATENT APPLICATION: US/09/869 , 142 TIME: 10.10.43 

Input Set : A:\PTO.txt W< 
Output Set: N:\CRF3\07092001\l869142.raw >i 

3 5 lilt t^LeT/iN— BACTERIA, NITRILASE GENE, NITRYL HYDRATASE 
GENE T AMI DAS E GENE FROM RHODOCOCCUS BACTERIUM, AND PROCESS FOR PRODUCING CARBOXYLIC 

7 ACIDS USING THEM 

9 <130> FILE REFERENCE : Q64574 IT£ , /rtQ/Q< - Q , i7 

C -> 11 <140> CURRENT APPLICATION NUMBER: US/09/869 , 142 
r— > 11 <141> CURRENT FILING DATE: 2001-06-26 

11 7l50> PRIOR APPLICATION NUMBER: USSN 60/183,754 
10 ^im>> PRTOR FILING DATE: 2000-02-22 
" <150> PRIOR ^PLICATION NUMBER: USSN 60/183,821 
1t; PPTOR FILING DATE: 2000-02-22 

< > PRIOR APPLICATION NUMBER : JPA 2000-107855 MTCPlEO 
18 <151> PRIOR FILING DATE: 2000-04-10 C Nj \ E ^ ^ 

20 <150> PRIOR APPLICATION NUMBER: JPA 2000-021797 

21 <151> PRIOR FILING DATE: 2000-01-26 

23 <150> PRIOR APPLICATION NUMBER: JPA 11-303212 

24 <151> PRIOR FILING DATE: 1999-10-26 
26 <160> NUMBER OF SEQ ID NOS: 7 

28 <170> SOFTWARE: Patentln version 3.1 

30 <210> SEQ ID NO: 1 

31 <211> LENGTH: 1531 

32 <212> TYPE: DNA 

33 <213> ORGANISM: Rhodococcus sp. 

35 <220> FEATURE: 

36 <221> NAME /KEY : exon 

37 <222> LOCATION: (324 ).. (1421) 

38 <223> OTHER INFORMATION : 

41 <400> SEQUENCE : 1 atacccqqqq atcgaaccag caacggggac 60 

42 agcttgacca tgattacgaa "ogagctcg £acccgggg * gaccaccacc 120 
44 gcacagtcga cgtagacctc gacctatccg ccgttccg y ™ gcgaagagcc 
46 acttcaacat ccttcaacgt gcccggccag = «tcg ^ 00^^ 
48 gcctcggacc ccccggccga accgctcga g ^ acctcgtact gtcc tgccaa 



180 
240 



48 gcctcggacc ccccggccga accgctcgat gaactc« ^ctc g tact gtcctgccaa 300 
50 caggacccgt gtcattccac 9tc«attcac g^ccttttc a cctcgt ^ ^ ^ 

52 acacaagcaa cggaggtacg gac atg gtc gaa tac ^ ^ ^ ^ ^ ^ 

53 5 10 

1 . „ a rn ntr. 401 



3.X.Q ULU uaa ua.^ " — — 

Met Val Glu Tyr Thr Asn Thr Phe Lys Val 

I K H S S K S E K S S C H K £ S « 
S C S S « £ S S S K E K 3 K S S S 

| £ !£ H s S K £ ss s s ss - s s = s 

S S !S C K S S K S S H S S S « S £ 
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DATE- 07/09/2001 

RAW SEQUENCE LISTING in- 10- 43 

PATENT APPLICATION: US/09/869 , 142 TIME. 10.10.43 

Input Set : A:\PTO.txt 

Output Set: N:\CRF3\07092001\I869142.raw 

fi c 70 
60 00 „ ^ r-xn r.aa tta etc 593 



]l cac I, .at tec ctg acg atg gac age ccg =J= Jt. -J g. £ 

73 His Glu Asn Ser Leu Thr Met Asp Ser Pro his gQ 

80 

?S St gee gee ego gac cac aac ate jee gt. gtg JtJ «. Jt. ago gag 
77 Asp Ma Ala Atg Asp His Asn II. Ala Val Val y ^ 

I s s s s s s s k i a s - £ 5 K s 

S 35 1= 2 S E S S S IS £ 3 S 1 = S H 
H S 2 S 5 S S 5 = - 5= S S S= !2 £ 5 

9° 140 145 „ tac taq qag cat ttc cag 

S K S - Sa Arg feu ST, S 2 Tsn g £ Sx2 His P h e Cl„ 

II acg etc aec aag tac Z at, tac tee ate cac gag cag gtg cac otc 

97 Thr Leu Thr Lys Tyr Ala Met Tyr Ser Met ^ 

Is; a = ss si s s 2 1 = - «• - i s £ 

£ S S S K S £ S K S S E S K £ K S 

106 205 . «.„ tnr arc acc caq gtg gtc aca ccg gag gec cac 

Z til S £. E? £ Z £ - Si! !.! Sal Thj Pro Cl« Ala His 

\\l gag III tte tec cae aac gag £• - cga ate tte ate ggc eg, gee 

113 Glu Phe Phe Cys Glu Asn Glu Glu Gin Arg Met l ^ 
gga ggt tte ,c, 0,0 ate ate ggg ccc gac ggc ceo gat etc g=a act 

117 Gly Gly Phe Ala Arg He lie Gly Pro asp y ^ 

118 255 „ n r,rr a fr etc tac qcc gac ate gat ctg 

120 cct etc gee gaa gat gag gag ggg ate etc ta 9 ^ ^ ^ 

121 Pro Leu Ala Glu Asp Glu Glu Gly lie beu y ^ 

\ll tct gcg ate ace ttg gcg aag «g gec get gac ccc gtg ggc eac tac 

125 Ser Ala He Thr Leu Ala Lys Gin Ala Ala Asp ^ 

126 285 cqc cgc a cc acg 

128 tea egg ccg gat gtg ctg teg ctg aac ttc aac cag ^ ^ ^ ^ 

129 Ser Arg Pro Asp Val Leu Ser Leu Asn Phe as ^ 

130 300 ... I I a nn *tc cat acc acg cac acg ttc gtg 

132 ccc gtc aac acc eca ctt tec acc ate cat gee g ^ ^ ^ 

133 Pro Val Asn Thr Pro Leu Ser Thr He Hxs ai 33Q 



134 315 
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DATE: 07/09/2001 
RAW SEQUENCE LISTING in- 10- 43 

PATENT APPLICATION: US/09/869,142 TIME. 10.10.43 

Input Set : A:\PTO.txt 

Output Set: N:\CRF3\07092001\l869142.raw 

13 6 ecg ca g ttc ggg gc. etc g ac gg c gtc egt gag etc aac gga gcg gae 1361 

137 Pro Gin Phe Gly Ala Leu Asp Gly Val Arg biu ^ 

138 335 ai -a rat tec qac ga g acg gac egg gcg 1409 

140 gaa cag cge gca ttg eec tee aca eat tec gae g g g 9 ^ 

141 Glu Gin Arg Ala Leu Pro Ser Thr His Ser Asp ^ 

! 4 44 aca gec aec etc tgacteggge geaeccgtgg egeeteegaa gegccacggg 
145 Thr Ala Thr Leu 

ut twwww*' ^""<" a 9tac=5a9Ct C9 "" C5ta i«i 

150 atcatggtca 

153 <210> SEQ ID NO: 2 

154 <211> LENGTH: 366 

155 <212> TYPE: PRT 

156 <213> ORGANISM: Rhodococcus sp. 

Ill t^XZ Jhr -n T h r Phe Lys Val Ma Ala Val Gin Ma Gin 
IS Pro Val Trp Phe Lp Ma Ala Lys Thr Val Asp Lys Thr Val Ser lie 
iS He Ala Glu Ma Ala Arg Asn Gly Cys Glu Leu Val Ma Phe Pro Glu 
! 7 6 2 Val Phe lie Pro Gly Tyr Pro Tyr His He Trp Val Asp Ser Pro Leu 
\]l Ala fly Met Ma Lys Phe Ma Val Arg Tyr His Glu Asn Ser Leu Thr 
III Met Asp Ser Pro His Val Gin Arg Leu Leu Asp Ma Ma Arg Asp His 

181 ti a 13 val Val Val Gly He Ser Glu Arg Asp Gly Gly Ser Leu 

184 Asn He Ala Val Val vai j- ^ n0 

185 i?° t tip Tie Asd Ala Asp Gly Gin Leu Val Ala Arg 

188 Tyr Met Thr Gin Leu He He Asp Aia asp y ^ 

189 . ?l tph Lvs Pro Thr His Val Glu Arg Ser Val Tyr Gly Glu 

192 Arg Arg Lys Leu Lys Pro nr n±& ^ 

193 130 T1 c v v a i Tvr Asd Met Pro Phe Ala Arg Leu 

196 Gly Asn Gly Ser Asp He Ser Val Tyr Asp Met ^ 

197 145 150 , „. D . rln Thr Leu Thr Lys Tyr Ala 
200 Gly Ala Leu Asn Cys Trp Glu His Phe Gin Thr ^ 

III Met Tyr Ser Met His Glu Gin val His Val Ma Ser Trp Pro Gly Met 
III ser ,eu Tyr Z Pro Glu Val Pro Ala Phe Gly Val Asp Ala Gl„ leu 

209 195 — rlll rlv Gln Thr Phe Val Val Cys 

212 Thr Ala Thr Arg Met Tyr Ala Leu Glu Gly Gin inr 

213 210 2 ^ , , . ... rlll ph ~ phe Cvs Glu Asn 

216 Thr Thr Gin Val Val Thr Pro Glu Ala His Glu Phe Cys ^ 

217 225 f ti- n„ Ara Glv Glv Gly Phe Ala Arg He 

220 Glu Glu Gin Arg Met Leu He Gly Arg Gly wy Y ^ 

221 ^ 5 » T pn Ala Thr Pro Leu Ala Glu Asp Glu 
224 He Gly Pro Asp Gly Arg Asp Leu Ala inr 



file://C:\Crf3\Outhold\VsrI869142.htra 



7/9/01 



Page 4 or / 



DATE- 07/09/2001 
RAW SEQUENCE LISTING °AiL. 
PATENT APPLICATION: US/09/869,142 TIME: 10:10.43 



Input set : A:\riw.tJi.*. 

Output Set: N:\CRF3\07092001\I869142.raw 

228 Glu Gly He HI Tyr Ala Asp lie IZ Leu Ser Ala lie Thr Leu Ala 

232 Lys Gin All Ala Asp Pro Val Gly His Tyr Ser Arg Pro As P Val Leu 

ocin 295 oUu 

236 Ser leu Asn Phe Asn Gin Arg Arg Thr Thr Pro Val Asn Thr Pro Leu 

III III Thr lie His Ala Thr His Thr Phe Val Pro Gin Phe Gly Al. Leu 

325 j~3U 
111 »p Gly Val Arg Glu Leu Asn Gly Ala Asp Glu Gin Arg Ala Leu Pro 

"I Ser Thr His s" Asp Glu Thr Asp Arg Al, Thr Al, Thr Leu 



252 <210> SEQ ID NO: 3 

253 <211> LENGTH: 2822 

254 <212> TYPE: DNA 

255 <213> ORGANISM: Rhodococcus sp. 

257 <220> FEATURE: 

258 <221> NAME /KEY : CDS 
o c q LOCATION: (1379) (2068 ) 

260 <223> OtIeR INFORMATION : nitrile hydratase beta subumt 

263 <220> FEATURE: 

264 <221> NAME /KEY: CDS 

III <llll ^—Vo^lTJn. hydratase alpha suhunit 

^ S^r-u. ~£ »s a — -g^og 
- «S SSSS =i -fj« SSSE 

2 ^ sis s» : t« nss - 

280 tacacgtga, tggacgatgc ctgggcgcta 9tcggatgtg "acccaccc ggcaactgtt 
282 cccgectacg cogaag.ceg 9»acc, ? c9t =9tccctg=c t^cgt^ « ^ 
284 gtgaacgccc 9.9C99=c« «=99=tc« cagttgg g 99 aacacctago 
286 goccacggcg ggacctacgc "cttcggcc gga gg g » tc c , ca gccca, 

2! = -f a „a o = ca 

% ss ras ss — - 

296 tctgacaatg ctgatcocct gccgccgccg "ggacgacc ^ag«g« 9 9 9 
298 gagcca.cc. «9gcatcat gcgato< , g 9 cctat^gg 9^9^^ 

^ gg«,gcgaa Socager ^acarcga . 99tgggag —arc ggagacgoa, 
304 acacccggag W""^ = =cc99,c= ££££ S££c4t >gtgtg=gg9 
306 agegcaageg attcaatctt gttacttcca g * ctcttttcga aogagaaccg 
308 gagagogece gaaogoaggg 'SKtcaa cgattgttgt gctgtgaagg 1260 

310 geeggtacag "aatccgga =«««gtga ^9"=.. 9 9 9 ccttctccct 13 20 

IS U£f.£ 9-^ga cS«"ga 9?c=agc?cc gatgaaagg, atgaggaa 13,8 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
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RAW SEQUENCE LISTING DATE: 07/09/2001 

PATENT APPLICATION: US/09/869, 142 TIME : 10 : 10 . 43 

Input Set : A:\PTO.txt 

Output Set: N:\CRF3\07092001\I869142.raw 

316 atg gat ggt ate cac gac aca ggc ggc atg acc gga tac gga ccg gtc 1426 
3" 2? Asp Gly He His Asp Thr Gly Gly Met Thr Gly Tyr Gly Pro Val 

(- Q J- ^ 

318 1 3 ^ _ „,„ 4-,„ t-™ nan oat caa 1474 



320 ccc tat cag aag gac gag ccc ttc ttc cac tac gag tgg gag ggt cga 
32? Pro Tyr Gin Lys Asp Glu Pro Phe Phe His Tyr Glu Trp Glu Gly Arg 
322 20 25 1522 



1666 



324 acc ctg teg att ctg acc tgg atg cat etc aag ggc atg teg tgg tgg 

325 Thr Leu Ser lie Leu Thr Trp Met His Leu Lys Gly Met Ser Trp Trp 
S?fi 35 40 

328 qac aag teg egg ttc ttc egg gag teg atg ggg aac gaa aac tac gtc 

329 Asp Lys Ser Arg Phe Phe Arg Glu Ser Met Gly Asn Glu Asn Tyr Val 

330 50 55 60 

332 aac qaq att cgc aac teg tac tac acc cac tgg ctg agt gcg gcg gaa 

333 Asn Til lie Arg Asn Ser Tyr Tyr Thr His Trp Leu Ser Ala Ala Glu 
fiS 70 75 

336 cat ate etc gtc gee gac aag ate ate acc gaa gaa gag cga aag cac 

337 Arg lie leu Val Ala Asp Lys He lie Thr Glu Glu Glu Arg Lys His 

340 cgc gtg cag gag ate etc gag ggt egg tac acg gac agg aac ccg teg 1714 

341 Arg Val Gin Glu lie Leu Glu Gly Arg Tyr Thr Asp Arg Asn Pro Ser 

100 105 

3 egg aag ttc gat ccg gee gag ate gag aag gcg ate gag agg ctt cac 
345 Arg Lys Phe Asp Pro Ala Glu lie Glu Lys Ala lie Glu Arg Leu 

348 qaq eee III tec eta gtg ctt III gga gcg gag ccg agt ttc tee etc 1810 

349 Glu Pro Ss Ser Leu Val Leu Pro Gly Ala Glu Pro Ser Phe Ser Leu 

352 ggt gac aag gtc aaa gtg aag aac atg aac ccg ctg gga cac aca egg 1858 

353 Gly Asp Lys Val Lys Val Lys Asn Met Asn Pro Leu Gly His Thr Arg 

1 i S »! £ !2 S £ S S S S £ !S S 2j K 1906 

S « * s - i - a s = b s ss s s s s 1954 

364 ccc cgc CC "= tac acc gte gcg ttt tec gee cag «« tgg «c 2002 

365 Pro Arg Pre Leu Tyr Thr Val Ala Phe Ser Ala Gin Glu Leu Trp Gly 

368 gac gac gga aac ggg aaa gac III gtg tgc gtc gat etc tgg gaa ccg 2050 

369 Asp Asp Gly Asn Gly Lys Asp Val Val Cys Val Asp Leu Trp Glu 

3" t,c erg ate tct ,c, tga aaggaatac, ata gt, age gag eje gtc a.t 2099 
373 Tyr Leu lie Ser Ala J'J 235 

376 aag tac acg gag tac gag gea cgt ace aag gca ate gaa acc ttg ctg 2147 

377 Lys Tyr Thr Glu Tyr Glu Ala Arg Thr Lys Ala He Glu Thr Leu Leu 

III tac gag cga ggg cic. ate aeg ccc gee gcg gtc gac cga gtc gtt teg 2195 
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Uk of n and/or Xaa has been detected in the Sequence Lining, 
iteciew vl\e Sequence Listing to insure a corresponding 
explanation is presented in die <220> to <223> fields Of 
each sequence using n or Xaa. 
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VERIFICATION SUMMARY DATE: 07/09/2001 

PATENT APPLICATION: US/09/869,142 TIME: 10:10:44 



Input Set : A:\PTO.txt 

Output Set: N:\CRF3\07092001\l869142.raw 

L:ll M:270 C: Current Application Number differs, Replaced Current Application No 
L:ll M:271 C: Current Filing Date differs, Replaced Current Filing Date 
L:607 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:6 
L:617 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:6 
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